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Abstract — Resource allocation is investigated for fading relay 
channels under separate power constraints at the source and 
relay nodes. As a basic information-theoretic model for fading 
relay channels, the parallel relay channel is first studied, which 
consists of multiple independent three-terminal relay channels 
as subchannels. Lower and upper bounds on the capacity are 
derived, and are shown to match, and thus establish the capacity 
for the parallel relay channel with degraded subchannels. This 
capacity theorem is further demonstrated via the Gaussian 
parallel relay channel with degraded subchannels, for which the 
synchronized and asynchronized capacities are obtained. The 
capacity achieving power allocation at the source and relay 
nodes among the subchannels is partially characterized for the 
synchronized case and fully characterized for the asynchronized 
case. The fading relay channel is then studied, which is based 
on the three-terminal relay channel with each communication 
link being corrupted by a multiplicative fading gain coefficient as 
well as an additive Gaussian noise term. For each link, the fading 
state information is assumed to be known at both the transmitter 
and the receiver. The source and relay nodes are allowed to 
allocate their power adaptively according to the instantaneous 
channel state information. The source and relay nodes are 
assumed to be subject to separate power constraints. For both 
the full-duplex and half-duplex cases, power allocations that 
maximize the achievable rates are obtained. In the half-duplex 
case, the power allocation needs to be jointly optimized with the 
channel resource (time and bandwidth) allocation between the 
two orthogonal channels over which the relay node transmits and 
receives. Capacities are established for fading relay channels that 
satisfy certain conditions. 

Index Terms — Capacity, max-min, parallel relay channels, 
resource allocation, wireless relay channels. 

I. Introduction 

The three-terminal relay channel was introduced by van 
der Meulen [1] and was initially studied primarily in the 
context of multiuser information theory [1], [2], [3]. In re- 
cent years, relaying has emerged as a powerful technique to 
improve the reliability and throughput of wireless networks. 
An understanding of wireless relay channels has thus become 
an important area of research. Wireless relay channels and 
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networks have been addressed from various aspects, including 
information-theoretic capacity [4], [5], [6], [7], [8], [9], [10], 
[11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], 
diversity [22], [23], [24], [25], outage performance [26], [27], 
and cooperative coding [28], [29], [30]. Central to the study of 
wireless relay channels is the problem of resource allocation. 
For example, the source and relay nodes can dynamically 
allocate their transmit powers to achieve a better rate if the 
fading state information is available. Resource allocation for 
relay channels and networks has been studied by several recent 
papers, including [9], [31], [32], [33], [34], [26]. Common to 
all of these studies is the assumption that the source and relay 
nodes are subject to a total power constraint. 

In this paper, we study wireless fading relay channels, where 
we assume that the source and relay nodes are subject to 
separate power constraints instead of a total power constraint. 
This assumption is more practical for wireless networks, 
because the source and relay nodes are usually geographically 
separated, and are hence supported by separate power supplies. 
Under this assumption, the resource allocation problem falls 
under a class of max-min problems. We connect such max-min 
problems to the minimax two hypothesis testing problem (see, 
e.g., [35, II.C]), and apply a similar technique to find optimal 
(in the max-min sense) resource allocation strategies for fading 
relay channels. 

We first study the parallel relay channel, which consists 
of multiple independent relay channels and serves as a basic 
information-theoretic model for fading relay channels. We 
derive a lower bound on the capacity based on the partial 
decode-and-forward scheme as well as a cut-set upper bound. 
We show that the two bounds match and establish the capacity 
for the parallel relay channel with degraded subchannels. This 
generalizes the capacity result in [36, Th. 12] to multiple sub- 
channels. We also demonstrate that the parallel relay channel is 
not a simple combination of subchannels in that the capacity 
of the parallel relay channel can be larger than the sum of 
the capacities of subchannels, as was also remarked in [36, 
Sec. VII]. 

We then study the Gaussian parallel relay channel with 
degraded subchannels. There are two types of capacity that 
can be defined for this channel. The first is the synchronized 
capacity, where the source and relay inputs are allowed to 
be correlated. To achieve the capacity, the source and relay 
nodes need to choose an optimal correlation parameter for each 
subchannel, and further to choose an optimal power allocation 
across the subchannels under separate power constraints. We 
characterize the optimal solutions for the cases where the 
optimization is convex, and provide equations that need to 
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be solved numerically for cases where the optimization is 
nonconvex. We also study the asynchronized capacity, where 
the source and relay inputs are required to be independent. 
This capacity is easier to achieve in practice due to the simpler 
transceiver design for the source and relay nodes. We fully 
characterize the capacity-achieving power allocation at the 
source and relay nodes in closed form. 

We then move on to study the fading relay channel, which 
is based on the classical relay channel with each transmission 
link being corrupted by a multiplicative stationary and ergodic 
fading process as well as an additive white Gaussian noise 
process. The fading relay channel is a special case of the 
parallel relay channel, with each subchannel corresponding to 
one fading state realization. We assume that both the trans- 
mitter and the receiver know the channel state information, so 
that the source and relay nodes can allocate their transmission 
powers adaptively according to the instantaneous fading state 
information. We consider the resource allocation problem for 
two fading relay models: full-duplex and half-duplex. 

The fading full-duplex relay channel has been studied in 
[9], where lower and upper bounds on the capacity were 
derived, along with the resource allocation that optimizes these 
bounds, under a total power constraint for the source and relay 
nodes. In this paper, we assume separate power constraints 
for the source and relay nodes and study the power allocation 
that optimizes the capacity bounds. We focus on the more 
practical asynchronized case. We obtain the power allocation 
that maximizes an achievable rate, and show that the optimal 
power allocation may be two-level water-filling, orthogonal 
division water-filling, or iterative water-filling depending on 
the channel statistics and the power constraints. We also 
establish the asynchronized capacity for channels that satisfy 
a certain condition. 

We further study a fading half-duplex relay channel model, 
where the source node transmits to the relay and destination 
nodes in one channel, and the relay node transmits to the 
destination node in an orthogonal channel. We introduce a 
parameter 9 to represent the channel resource (time and band- 
width) allocation between the two orthogonal channels. We 
study three scenarios. In Scenario I, where the two orthogonal 
channels share the channel resource equally, i.e., 9 = 1/2, 
we show that the optimal power allocation falls into three 
cases depending on the ranges of power constraints at the 
source and relay nodes. The optimal power allocation for the 
relay node is always water-filling, but the power allocation for 
the source node is not water-filling in general. In scenario II, 
the channel resource allocation parameter 9 needs to be same 
for all channel states but can be jointly optimized with the 
power allocation. In Scenario III, which is the most general 
scenario, 9 can change with channel realizations and is jointly 
optimized with power allocation. For both Scenarios II and 
III, we derive the jointly optimal 9 and power allocation that 
maximize the achievable rate. Furthermore, we show that the 
lower bound achieves the cut-set upper bound if the channel 
statistics and power constraint satisfy a certain condition. 
We hence establish the capacity for these channels over all 
possible power and channel resource allocations. 

The paper is organized as follows. In Section[II] the parallel 



relay channel is introduced and studied. In Section [TTT1 the 
optimal resource allocation that achieves the capacity for the 
Gaussian parallel relay channel with degraded subchannels 
is studied. In Section IIVI resource allocation for the fading 
full-duplex relay channel is presented. In Section |VJ resource 
allocation for the fading half-duplex relay channel is studied, 
where the three scenarios described above are considered. 
Finally in Section [VT] we give concluding remarks. 



II. Parallel Relay Channels 

In this section, we study the parallel relay channel, which 
serves as a basic information-theoretic model for the fading 
relay channels that are considered in Sections [IV] and [V] The 
parallel relay channel also models the relay channel where the 
source and relay nodes can transmit over multiple frequency 
bands with each subchannel corresponding to the channel over 
one frequency band. It is shown in this section that in contrast 
to the parallel point-to-point channel, the parallel relay channel 
is not a simple combination of independent subchannels. 

Definition 1: A parallel relay channel with K subchannels 
(see Fig. [TJ consists of K finite source input alphabets 
X\, . . . , Xk, K finite relay input alphabets X R \, . . . , X R k, 
K finite destination output alphabets y-y , . . . , y>K and K 
finite relay output alphabets y R i , ■ ■ ■ , yRK ■ The transition 
probability distribution is given by 

K 

J\pk{Vk,yRk\xk,XBk) (l) 

fe=l 

where x k e X k , x Rk <E X Rk , y k <E y k , and y Rk S y Rk for 
k = 1,...,K. 

A (2 nR ,n) code consists of the following: 

• One message set W = {1, 2, . . . , 2 nR } with the message 
W uniformly distributed over W; 

• One encoder at the source node that maps each message 
w 6 W to a codeword 

{X\\ j . . . , %\ny • • • 3 ^KX 3 • • • 3 -^ifn) 3 

• A set of relay functions {/i}™ =1 such that for 1 < i < n: 

(XRU, ■ ■ ■ , XRKi) 

= fiiVKlU ■ ■ ■ >tol[i-l]3 ■ ■ ■ iVRKl, ■ ■ ■ ,yRK[i-l])', 

• One decoder at the destination node that maps a received 
sequence (y n , y ln , y K1 , y Kn ) to a message 
w e W. 

Note that the relay node is allowed to jointly encode and 
decode across the K parallel subchannels. 

A rate R is achievable if there exists a sequence of (2 nR , nj 
codes with the average probability of error at the destination 
node going to zero as n goes to infinity. 

The following theorem provides lower and upper bounds on 
the capacity of the parallel relay channel. 
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Fig. 1. Parallel relay channel 



Theorem 1: For the parallel relay channel, a lower bound 
on the capacity is given by 



A' 



Ciow = max min < 2J I( x k , X Rk ;Y k ), 

{ k=l 

HQk;Y Rk \X Rk ) + I(X k ;Y k \Q k , X Rk )\ 

k=l ) 



(2) 



where Q k for k = 1, . . . , K are auxiliary random variables. 
The maximum in (O is over the joint distribution 



K 



Y\_p k (qk,x Rk , x k )p k (y k , y Rk \x k , x Rk ) . 



fe=i 



An upper bound on the capacity is given by 

C up = max min I ^ I( X k, X Rk ;Y k ), 



k=l 

K 



(3) 



J2l(Xk;Y k ,Y Rk \X Rk ) 



k=l 



where the maximum in (01 is over the joint distribution 



K 

Y\_Pk(x Rk ,x k )p k (y k ,y Rk \x k ,x Rk ). 

fc=i 

Remark 1: The lower bound ([2]l generalizes the rate given 
in [37, Theorem 1] based on the decode-and-forward scheme. 

Proof: To derive the lower bound iff}, we use the 
following achievable rate for the relay channel based on the 
partial decode-and-forward scheme given in [3]: 



R < max min 



[l(X R , X; Y), I(Q; Y R \X R ) + I(X; Y\Q, X R )} 



(4) 



We set Q = (Q U ...,Q K ), X = (X X ,...,X K ), 
X R = (X R1 , Xrk), Y = (Yi, . . . , Y K ), and Y R = 
(Y R i , . . . , Y R k) in the above achievable rate. We further 
choose (Qi,Xi,X R i), . . ., (Qk,Xk,X R k) to be indepen- 
dent, and then obtain the lower bound (fJJ. 

The upper bound (O is based on the cut-set bound [2, The- 
orem 4] and the independency of the K parallel subchannels. 



Remark 2: In the achievable scheme, the relay node first 
decodes information sent by the source node over each sub- 
channel. The relay node then reassigns total decoded infor- 
mation to each subchannel to forward to the destination node. 
Hence information that was sent to the relay node over one 
subchannel may be forwarded to the destination node over 
other subchannels, as long as the total rate at which the relay 
node can forward information to the destination node over all 
subchannels is larger than the total rate at which the relay node 
can decode information from the source node. 

The lower and upper bounds in Theorem [T] do not match in 
general. We next study a class of parallel relay channels with 
degraded subchannels. For this channel, the lower and upper 
bounds match, and we hence establish the capacity. Moreover, 
this capacity provides an achievable rate for the case where 
the subchannels are either stochastically degraded or reversely 
degraded (e.g., fading relay channels). 

Definition 2: Consider the parallel relay channel with de- 
graded subchannels. Assume each subchannel is either de- 
graded or reversely degraded, i.e., each subchannel satisfies 
either 



Pk(yk,ynk\xk,x Rk ) 

= Pk (yRk \x k ,x Rk )p k (y k \ y Rk , x Rk ) 



(5) 



or 



(6) 



Pk{yk,yRk\xk,x Rk ) 

= Pk(yk\xk,x Rk )p k (y Rk \y k ,x Rk ). 
We note that the parallel relay channel with degraded 
subchannels has been studied in [36, Sec. VII] for the two- 
subchannel case. We now generalize the result in [36, Sec. VII] 
to channels with multiple subchannels. In fact, our main focus 
is on the Gaussian case considered in this section and Section 
M 

We define the set A to contain the indices of the subchannels 
that satisfy (O, i.e., those subchannels where the source-to- 
relay channel is stronger than the source-to-destination chan- 
nel. Then the set A c contains the indices of the subchannels 
that satisfy ©, i.e., those subchannels where the source-to- 
relay channel is weaker than the source-to-destination channel. 
Note that in general the parallel relay channel with degraded 
subchannels is neither a degraded relay channel nor a reversely 
degraded channel. For this channel, the lower and upper 
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bounds given in Theorem Q] match and establish the following 
capacity theorem. 

Theorem 2: For the parallel relay channel with degraded 
subchannels, the capacity is given by 



C = max min I ^ I(X k , X Rk \Y k ), 

I k=l 

Y J l{X k -Y Rk \X Rk )+ I(X k ;Y k \X Rk )\. 

keA keA" ) 

where the maximum is over the joint distribution 



(7) 



K 



Y\_ Pk(x Rk ,X k )pk(yk, URk\x k ,X Rk ). 

fe=l 

Remark 3: Theorem |2] generalizes the capacity of the par- 
allel relay channel with unmatched degraded subchannels in 
[36, Theorem 12] to channels with multiple subchannels. 

Proof: The achievability follows from C\ ov in (fJJ by 
setting Q k = X k for k S A and setting Q k = <f> for k E 
A c . The converse follows from C up in (0 by applying the 
degradedness conditions ((5J and ([6j. ■ 

Note that the partial decode-and-forward scheme achieves 
the capacity of the parallel relay channel with degraded sub- 
channels. From the selection of Q k in the above achievability 
proof, it can be seen that the relay node decodes all the 
information sent over the degraded subchannels, i.e., Q k = X k 
for keA, and decodes no information sent over the reversely 
degraded subchannels, i.e., Q k = cf> for k £ A c . Hence for the 
subchannels with k 6 A c , the link from the source node to the 
relay node can be eliminated without changing the capacity of 
the channel. 

However, the relay node still plays an important role in the 
reversely degraded subchannels by forwarding information that 
it has decoded in other degraded subchannels to the destination 
node. This is different from the role of the relay node in a 
single reversely degraded channel, where it does not forward 
information at all. Furthermore, we see that in the parallel 
relay channel, information may be transmitted from the source 
node to the relay node in one subchannel, and be forwarded 
to the destination node over other subchannels, as we have 
commented in Remark |2] More importantly, in contrast to 
the parallel point-to-point channel, the capacity of the parallel 
relay channel with degraded subchannels in Theorem [2] can 
be larger than the following sum of the capacities of the 
subchannels 

max min j £ I(X k , X Rk] Y k ), £ I(X k ; Y Rk \X Rk ) 1 

[keA keA ) 



Y, I(X k ;Y k \X Rk ) 



keA" 



(8) 



This demonstrates that the parallel relay channel is not a 
simple combination of independent subchannels. This fact has 
also been pointed out in [36, Remark 15] for two-subchannel 
case. 

We now consider a Gaussian example of the parallel relay 
channel with degraded subchannels. The channel input-output 



relationship at one time instant is as follows. 

For keA, Y Rk = X k + Z Rk 

Yk = X k + ^fpRk~X Rk + Z Rk 



z' k , 



(9) 

where Z Rk and Z' k are independent Gaussian random variables 
with variances a Rk and a k 



a Rk , respectively. For k S A, 



For k e A c 




k -r & Rk 

PRkX Rk + Z k , 



(10) 

where Z k and Z' Rk are independent Gaussian random variables 
with variances a\ and a Rk — a\, respectively. For k £ A c , 
<J Rk > of. In ® and ([H, p Rk (assumed to be positive) indi- 
cates the ratio of the relay-to-destination SNR to the source-to- 
destination SNR for subchannel k. We assume that the source 
and relay input sequences are subject to the following average 
power constraints: 

i=l k=l i=l k=l 

(ID 

where i is the time index. 

It can be seen from (O and dTOb that the subchannels 
with k e A satisfy the degradedness condition (0 and the 
subchannels with k E A c satisfy the degradedness condition 
©. Hence the Gaussian channel defined in © and dTOb is 
the parallel relay channel with degraded subchannels. The 
following capacity theorem is based on Theorem [2] 

Theorem 3: The capacity of the Gaussian parallel relay 
channel with degraded subchannels is given by 

C = max 

0</3fc<l, for k=l,...,K 



K 



k=l 



E< 

keA 



Pk + PRk PRk + 2 a/ fikPRkPkPRk 



'faPk 



'Rk 



keA" 



^Pk 



(12) 

where (3 k — 1 — f3 k , and the function C(x) :— ^ log(l + x). 
In ([T2b , the parameter (3 k indicates correlation between the 
source input and the relay input to subchannel k, and P k and 
P Rk indicate the source and relay powers that are allocated 
for transmission over subchannel k. 

Proof: The achievability follows from Theorem [2] by 
choosing the following joint distribution: 

X Rk ~Af(0,P Rk ), 

with X' k independent of X Rk , 



X' k ~M(0,f3kP k ), 



Xl 



IfhPk 

PRk 



X 



Rk 



X'k 



(13) 

The converse is similar to the steps in the converse proof in 
[2, Sec. IV], and is omitted. ■ 
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Note that the capacity in Theorem [3] is sometimes referred 
to as the synchronized capacity, because the source and relay 
nodes are allowed to use correlated inputs to exploit coherent 
combining gain. This may not be practical for encoder design. 
It is hence interesting to study the asynchronized capacity, 
where the source and relay nodes are assumed to use indepen- 
dent inputs. The following asynchronized capacity is derived 
by setting /3 k = 1 for k = 1, . . . , K in (fT2b . 

Corollary 1: For the Gaussian parallel relay channel with 
degraded subchannels, the asynchronized capacity is given by 



K 



c 



max 

££Ll PRk<PR 



k=l 



Pk + PRkPRk 



(14) 



c 



keA \ u Rk/ keAc 
To obtain the capacity in Theorem [3] and the asynchronized 
capacity in CorollaryQ] we still need to solve the optimization 
problems in (fT2l > and (fT4l >. i.e., to find the jointly optimal 
correlation parameters {(3k, for fc — 1,...,K} and power 
allocations {(Pk, Puk), for k — 1, . . . , K} in l fT2"l . and to find 
the optimal power allocations {(Pk, Pfik), for k = 1, . . . , K} 
in (fT4l . We study these optimization problems in the next 
section. 



III. Optimal Resource Allocation for Gaussian 
Parallel Relay Channels with Degraded 
Subchannels 

In this section, we study the optimization problems in (fT2l i 
and (fl4l >. which are max-min optimization problems. We first 
introduce a general technique for solving this class of max-min 
optimization problems. We then demonstrate the application 
of this technique by finding the optimal solutions in (TPZt 
and ( fl4] i. We obtain the analytic form of the jointly optimal 
correlation parameters {(3k, for k — 1,...,K} and power 
allocation {(Pk,PRk), for fc = l,...,K} that achieve the 
synchronized capacity for the cases where the optimization 
problem is convex. We also obtain a closed-form solution for 
the optimal {(Pk, Pfik), for k = 1, . . . , K} that achieve the 
asynchronized capacity. This optimal solution may have three 
different structures depending on the channel SNRs and power 
constraints. This optimal power allocation is directly related to 
the power allocation for the fading full-duplex relay channel 
presented in Section llVl 

A. Technique to Solve a Class of Max-Min Problem 
Consider the following max-min problem: 



max mm{Ri(t), R 2 (t)} 



(15) 



where t is a real vector in a set Q, and R\ (t) and R 2 (t.) are 
real continuous functions of t. An optimal t* is referred to as 
a max-min rule. 

We now introduce a technique to solve the max-min 
problem (flBl . We will also illustrate this technique with a 
geometric interpretation. This technique is similar to that used 




Ri(t) 



Fig. 2. Illustration of functions V(a) and R(a,t) 



in finding the minimax detection rule in the two hypothesis 
testing problem (see, e.g., [35, Sec. LLC]). 
Consider the following function: 



R(a,t) := aRi(t) + (1 - a)R 2 (t), 



< a < 1. (16) 



As a function of a, R(a,t) is a straight line from R(0,i) = 
i?2 (i) to R(l,t) = Ri(t). Hence the maximization in ( TTBT ) 
corresponds to maximizing the minimal of the two end points 
of the line R(a,t) over all possible tEQ. 
We further define a function 



V(a) := m&xR(a,t) = R(a,t {a) ), 



(17) 



where v- a ' maximizes R(a, t) for fixed a. From the definitions 
of V(a) and R(a,t), it is easy to see the following two facts 
(see Fig. |2]for an illustration): 
Fact 1: The function V(a) is continuous and convex 

for a e [0,1]; 
Fact 2: For any power allocation rule t(zQ, R(a,t) as 
a function of a is completely below the convex 
curve V(a) or tangent to it. 
A known general solution to the max-min optimization 
problem in ( TT3T > is summarized in the following proposition. 

Proposition 1: Suppose a* is a solution to V(a*) = 
min ag [ ,i] V( a )- Then t^ a ) is a max-min rule, i.e., a solution 
to the max-min problem in (fT31 l. The relationship between 
i?i(i( Q *)) and R 2 (t {a ">) falls into the following three cases 
(see Fig. [3): 
Case 1: If a* = 0, i?i(^ Q *)) > R 2 (^ a ^); 

If a* = 1, Ri(^ a ">) < i? 2 (i ( "* ) ); 
(Equalizer rule) If < a* < 1, Ri(t( a ">) = 
R2(t_ (a ' ) ). 

This technique of finding the max-min solution is applied 
throughout the paper. 

B. Optimal Resource Allocation for Gaussian Parallel Relay 
Channel: Synchronized Case 

In this subsection, we apply Proposition [T] to find jointly 
optimal {/3 k , for fc = 1,...,K} and {(P k ,P Rk ), for fc = 
1,...,K} that solve the max-min problem in ( [12] ). This 



Case 2 
Case 3 



V(a) 



i? 2 a (Q * ) ) 



a* = 



R 2 {t^) 




a* = 1 



i?i(t( Q *)) 



a* 1 

a 

Case 3 

Fig. 3. Illustration of Cases 1, 2, and 3 in Proposition [T] 



optimal solution provides the optimal correlation between 
the source and relay inputs over each subchannel and the 
optimal source and relay power allocation among the K 
subchannels that achieve the synchronized capacity of the 
Gaussian parallel relay channel with degraded subchannels. 
We study the asynchronized case in the next subsection. 

To simplify notation, we let 



P=(P 1 ,...,P K ) 



Pr 



in- 



.Prk), 



(18) 



and 



K 



K 



Q = \(P,P R ,P)- E Pfe ^ p < E p ^ p «> 

I k=l k=\ 

0< Pk <1, for k = 1,...,k\ 



(19) 



The max-min optimization problem in ( TT~2T > can be written 
in the following compact form. 



C= max minimi (P,P R ,0), H 2 (P,P R ,§)} 
where 

ni{p,p R ,p) 

_ ( Pk + PRkPRk + ^\JPkPR.kPkPRk \ 



k=l 



n 2 (P,p R ,0) = J2 c 



keA 



Pkpk 



' Rk 



keA" 



fikPk 



(20) 

According to Proposition [T] the max-min rule that solves 
(l20b may fall into the following three cases. 

Case 1: a* = 0, and (P (0) , Pj£\ /? (0) ) is a max-min rule, 
which needs to satisfy the condition 



Ri(p {0 \p ( r\p (0) ) > R2(p (0) ,p ( r ) ,^ 0) ] 



(21) 



By definition, (P (0) , P { ° ] , /? (0) ) maximizes 
R(O,P,P R ,0 = R 2 (P,P R ,§). 
It is readily seen that the following (3^ is optimal: 

I 1 ' 

I arbitrary 

With /3 (0) given in (EH), R 2 (P,P R ,§) is a function of P 
only. Moreover, it is a convex function of P. Then the Kuhn- 
Tucker condition (KKT condition) (see, e.g., [38, p. 314-315]) 
characterizes the necessary and sufficient condition that the 
optimal Pj QS> needs to satisfy. The Lagrangian is given by 



ifPf >0; 
if P< 0) = 0. 



(22) 



(23) 



keA 



°Rk 



(24) 



which implies the following KKT condition: 
dC 1 l_ 

dPk ~ 



2 In 2 



'Rk 



Pk 



A < 0, 



dC 



1 



with equality if Pk > 0, 
1 



- A < 0, 



dPk 2 In 2 cr| + P fe 

with equality if Pk > 0, 

Hence the optimal P^ ^ is given by 



if A: e A: 



if jfc e A c 



(25) 



p(o) _ 




2 



2 In 2A 



if k £ A; 
if ke A c 



(26) 
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where A is chosen to satisfy the power constraint Ylk=i ^ k — 
P. The function (■)+ is defined as 



if x > 
if x < 0. 



For case 1 to happen, (P (0) , P^ , /3 (0) ) needs to satisfy the 



condition OTT) . To characterize the least power Pr needed for 
case 1 to happen, p}^ needs to maximize Ri(P}°\ P R , P^) 
with given in d23l > and P} ^ given in i26\ , respectively. 



The optimal p}£) can be obtained by the KKT condition via 
the following Lagrangian: 

PRkPRk \ 



K ( Pf ] 




^ PRk - P *J ' (28) 



The KKT condition is given by 
dC 1 put 



<9P 



Rk 



2 In 2 n 2 

"k 



A*<0, 

PF + PRkPRk 

with equality if P Rk > 



which implies 

d(°) _ I 1 



p, 



(0) 



flfc 



2 In 2^ p^fc 



— M , for jfc = 1, . 



where /i is chosen to satisfy the power constraint Ylk=i Pnk — 
Pr, 

Note that ( f30b also follows directly from the standard water- 
filling solution if we further derive ( f28l ) in the following form: 



(29) 



,K 



(30) 



C 




(31) 



With P (0) , P^ , and /3 (0) given in (|26), ([SB, and <23k 
respectively, condition ( l2~lT becomes 




fcgj4 y kk / k£A c 

This condition is equivalent to the threshold condition Pr > 
Pr,u(P)- The threshold Pr )U (P) is a function of the source 
power constraint P, and is determined by the value of Pr that 
results in equality in (|32l . 

Therefore, if case 1 occurs, the optimal source power 
allocation P*- -* has a water-filling form, and the optimal 
relay power allocation pj R also has a water-filling form 
with P^ ' + as the equivalent noise levels. The optimal 
correlation parameter f3^ = 1 for P^ > 0, which indicates 
that coherent combining is not needed for this case. 

Case 2: a* = 1, and (P (1) , P^\f3_ {1) ) is a max-min rule, 
which needs to satisfy the condition 

Ri(P {1) ,Pr\P {1) ) < i?2(P (1) ,P^\/3 (1) ). (33) 



By definition, (P (1) , P^, (3 {1) ) maximizes 



R(l,E,P R ,l3) = Ri(P,P R ,P). 



(34) 



(27) We note that 




if Pi 15 > 0, and P% > 0; 



(i) 



otherwise. 



Rk 



(35) 

It can be shown that Ri(P_,P R , /3) is a convex function 

°f (E-iE-r) for given in ( t35l >. To derive the optimal 

(E^,^) *at maximizes Pi(P, P R ,f3_ {1) ), the Lagrangian 
can be written as 



K 



k=l 



Pk + PRkPRk + 2y/p Rk P k PRk 



K 



K 



(36) 



-A ^P fc -P - M [J2 p Rk-Pfi 



\k=l 



\k=l 



The optimal {P} 1 \p} r ) needs to satisfy the following KKT 
condition: 



dC 

dC 
dPi 



1 



'Pk + V PRk PRk 



2 In 2 



< Ax/Pfc, 



/ Pfc + V PRkPRk) 2 

with equality if P& > 0; 



!Pk + V PRkPRk 



Hi: 



2 In 2 




Pfc + V PRkPRk) 

with equality if Pr*, > 0. 

(37) 



From (f37j, it is clear that P^ 1} = p£j) = 0. 

According to ( T35l l. we have (3^ P^ = for fc = 1, . . . , K, 
which implies P 2 , P^ , ) = 0. Hence condition dHJ 
cannot be satisfied. Therefore, case 2 never happens. 

Case 3: < a* < 1, and (P (Q *\pjf _ 
min rule, where a* is determined by the following condition 



.P^,/?^) is a max- 



Ri(P (a '\P { R \§_ (a ' } ) = R2{P {a '\P { R \d [a " ] ). (38) 
We need to derive (P ( "*\P^ \§_ ( - a ^ > ) that maximizes 



(32) P(a*,P,P R ,/3)=a*P 1 (P,P R) /3) + (l 



■a*)R 2 (P,P R ,§) 
(39) 

for a fixed a*. This optimization problem is not convex. Now 
the KKT condition provides only a necessary condition that the 
optimal (P (a * ) ,P^* ) ,/3 (o! * ) ) needs to satisfy. One can still 
perform a brute force search over those (P, P_ R , (3) that satisfy 

the KKT condition to find the optimal (P {a *\ P { R * ) , §_ {a "" > )- 
However, it may be too complex to implement such an optimal 
solution that involves designing correlated source and relay 
inputs and also involves allocating the source and relay powers 
jointly with the correlation parameter for each subchannel. 
Hence it may not be worth searching for the jointly optimal 



solution (P 



-R 



), except in case 1, where using 



independent source and relay inputs is optimal and the optimal 
power allocation (P} a ^ , p} r ' ) is simpler. It is hence more in- 
teresting to study the asynchronized case, where it is assumed 
that the source and relay nodes use independent inputs. 
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C. Optimal Resource Allocation for Gaussian Parallel Relay 
Channel: Asynchronized Case 

In this subsection, we solve the max-min problem in ( TB) . 
This problem is simpler than the max-min problem in ( [T2") . 
because the optimization is over the power allocation (P, P_ R ) 
only, and does not involve the correlation parameters (3. This 
also makes the optimal solution easy to implement in practice. 
In the following, we fully characterize the optimal power 
allocation, which may take three possible structures. 

We let 



Q=\{P,Pr): Y,Pk<P, J2 P * k ^ P ") 
{ fe=i fe=i J 



(40) 



and rewrite the max-min optimization problem in ( fT4b in the 
following manner: 



C= max mm{Tli(P,P R ), K 2 {P,P R )} 
where 



K 



n 1 (p,p R ) = J2c 



Pk + PBM.Pi 



Rk 



n RkJ fceA c \ a k 



(41) 



We apply Proposition [T] to solve d4l) . and consider the 
following three cases. 

Case 1: a* =0, and {P}°\P R ) is a max-min rule, which 
needs to satisfy the condition 

Ri(P {0 \pf)>R2(P {0 \P ( x ) ). (42) 

The optimal (P (Q) ,P^) can be derived following the steps 
that are similar to those in case 1 of the synchronized case, 
and is given by 

+ 



P 



(0) 



1 



21n2A 
1 

21n2A 



'Rk 



4 



if he A 



if k e A c 



P 



(o) 



1 



P 



(0) 



2 In 2p p flfe 



for k = 1, . 



,/v 



(43) 

where A and /i are chosen to satisfy the power constraints 
Ef =1 ft < P and £f =1 ^Jtt < Pr- 

We refer to the optimal (P^ , P^. ) in (|43) as two-level 
water-filling for the following reason. The optimal P^ is first 
obtained via water-filling with respect to the noise levels a Rk 
and a\. The optimal P^ is then obtained via water-filling 



with P^ ' +0^ as equivalent noise levels, where P} v> is treated 
as an additional noise level. 

With (Pjf\P R °k) given in (03), condition (02) becomes 



(44) 




This condition is equivalent to the threshold condition P R > 
Pr,u(P), where the threshold Pr, u {P) is determined by the 
value of Pr that results in equality in (04). The threshold 
Pr,u(P) is clearly a function of the source power constraint 
P. 

Case 2: a* = 1, and {P {1) ,P ( R } ) is a max-min rule, which 
needs to satisfy the condition 



Ri(P (1 \P { R l) )<R2(P il \P$) 

By definition, (P (1) ,P^ 1) ) maximizes 

R(1,P,Pm) = Ri(P,Pr)- 



(45) 



(46) 



We first note that Pi (P, P_ R ) is a convex function of (P, P fl ) 
The Lagrangian can be written as 



K 



k=l 



Pfe + Pi?fcP 



A" 



-a 



\fc=l 



^Pfl/b-pfl) 

vfc=l / 



(47) 



According to the KKT condition, {P^\ P} R ) needs to satisfy 
dC 1 1 



8P k 2 In 2 al + Pk+pBkP, 



< A, 



with equality if Pfe > 0; 



<9£ 



1 



< 



(48) 



dPRk 2 In 2 + Pa- + pRkPRk PRk ' 
with equality if P R k > 



which implies 



If A < 
If A > 



JL 

PRk 



Pk = 



1 



21n2A 



Pm = 0, 



P fc = 0, Prm = f-i— - 
Pflfe \ 2 In 2p ph/c / 



If A - , Pfc + PRkPRk = I . - "/, 

Pflfe \21n2A 



1 



(49) 



where A and [i are chosen to satisfy the power constraints. In 
general, A ^ . The equation (|49) implies an orthogonal di- 
vision water-filling power allocation, i.e., for each subchannel, 
either the source node or the relay node allocates a positive 
amount of power. This power allocation is similar to the 
optimal power allocation for fading multiple access channels 
[39]. 

For case 2 to happen, (P} 1 ^, p} r ) needs to satisfy the 
condition (143) . i.e., 



K /p (1) + » P (1) 

^ c / Pk + PRkPRk 



fc=l 



< 




(1)' 



(50) 



keA 



k£A c 



This condition essentially requires that the relay power P R is 
small compared to the source power P. 
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Case 3: < a* < 1, and (P (a *\P { f } ) is a max-min rule, 



where A is chosen to satisfy the power constraint J^is-Li Pk ^ 
P. For a given P, the value of P fl can be obtained by using 
\, and its components have the following form: 



where a* is determined by the condition 

PiGP^P^) = R2{P {a '\P { R ] ) 



(51) 



We first derive (P} a ',P} R ') that maximizes 



R(a*,P,P R ) = a*Pi(P,P R ) + (1 - a*)P 2 (P, P fl ) 



for a given a*, and a* will be determined later. 
The Lagrangian can be written as 



/ Pfe + PRkPRk 



(l-a*)£C 



P fc 



v.fe=l / 



which implies the following KKT condition: 

a* 1 



For k £ A, — 

OPk 



For fc € A c 



dC 

dP k 



2 In 2 a 2 k + P k + p Rk P Rk 

1 - a* 1 
+ 21n2 ' <J Rk +P k " ' 
with equality if P k > 0; 

1 



(54) 



or 



2 In 2 <jI + P k + p Rk P Rk 
I -a* 1 

< A, 



(55) 



For fc = 1, 



,K, 



21n2 d^ + Pfc 
with equality if P k > 0; 



d£ _ a* _ 
dP Rk ~ 21n2 ' ^| + P fc + p^Pflfc 
with equality if Pr^ > 0. 



< 



(56) 



The optimal (P (Q *\pjf } ) can be solved by an iterative 
algorithm. For a given P_ R , the value of P can be obtained by 
solving d54l and (1551 . and its components have the following 
form: 



P- 



positive root x of (T58b if it exists, otherwise 0, 

if k e A; 

positive root x of d59l > if it exists, otherwise 0, 

if k e A c 



(57) 

where the roots are determined by the following equations: 

a* 1 1 -a* 1 _ 

21n2 ' x + p Rk P Rk + a\ + 21n2 ' x + a\ k ~ 



1 



1 - a* 



1 



21n2 x + p Rk P Rk + at 21n2 x + ai 



= A (59) 



Pfifc 



a' 



2 In 2fi 



PRk 



for fc = 1, 



,K 
(60) 

(52) where /i is chosen to satisfy the power constraint J\ =1 Pra: < 
Pit. 

If we iteratively obtain P and P fl according to d57l i and ( f60b 
with an initial P fl , we show in the following that (P_,P_ R ) 
converges to an optimal (P} a \P^ R )• We refer to this 
optimal power allocation as the iterative water-filling power 
■ l;i i allocation. We finally need to search over < a < 1 to find 
a* that satisfies the equalizer condition d5TT) . 

Proof of Convergenece: We show that (P_,P_ R ) obtained 
iteratively according to (l57l l and d60l l converges to an optimal 
(J^ a *\P£ )). We first note that after each iteration the 



(53) 



objective function (1521 either increases or remains the same. 
We also note that the objective function is bounded from the 
above because of the power constraints at the source and relay 
nodes. Hence the objective function must converge. It is easy 
to check that for a given P, the objective function is a strictly 
concave function of P_ R , and d60b yields the unique optimal 
P_ R . It is also true that for a fixed P_ R , d57b yields the unique 
optimal P. Hence as the objective function converges, (P, P_ R ) 
must converge. Moreover, (P_,P_ R ) converges to the solution 
of the KKT conditions, which are sufficient for (P_,P_ R ) to 
be optimal because the objective function is concave over 

(P,P R )e Q- ■ 

We now summarize the optimal power allocation that solves 
fill in the following theorem. 

Theorem 4: The optimal solution to fill , i.e., the optimal 
power allocation that achieves the asynchronized capacity ( TT4l 
falls into the following three cases: 

Case 1: The optimal (P_,P_ R ) takes the two-level water- 
filling form and is given by (03). This case happens if 
P R > Pr iM (P) where the threshold Pr jU (P) is determined 
by equality of fi4l . 

Case 2: The optimal (P_,P_ R ) takes the orthogonal division 
water-filling form and is given by t49\ . This case happens if 
condition (f50b is satisfied. 

Case 3: The optimal (P,P R ) takes the iterative water- 
filling form and is obtained iteratively by d57l > and (160) . 

IV. Fading Full-Duplex Relay Channels 

In this section, we study the three-terminal relay channel [1], 
[2] in the context of wireless networks, where nodes commu- 
nicate over time-varying wireless channels. We are interested 
in how the source and relay nodes should dynamically change 
their power with wireless channel variation to achieve optimal 
performance. Such wireless relay channels can be modelled by 
the fading full-duplex relay model, where each transmission 
link of a three-terminal relay channel [1], [2] is corrupted 
by a multiplicative fading gain coefficient in addition to an 
additive white Gaussian noise term (see Fig. 0J. The fading 
relay channel is referred to as the full-duplex channel because 
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the relay node is allowed to transmit and receive at the same 
time and in the same frequency band. 



Yr '■ Xr 



a set A :— {h : p 3 \h 3 \ 2 > pi\h\\ 2 }, which contains all the 
fading states h with the source-to-relay link being better than 
the source-to-destination link. The complement of the set A 
is A c := {h : p 3 \h 3 \ 2 < pi\hi\ 2 }. We define a set Q that 
contains all power allocation functions that satisfy the power 
constraints, i.e., 



Sine X 



Y Dssfatn 



Fig. 4. Fading full-duplex relay channel 

The channel input-output relationship at each symbol time 
can be written as 



Y = yfplhiX + ^h 2 X R + Z, 
Yr = Jpih 3 X + Z R , 



(61) 



where hi, h%, and h 3 are fading gain coefficients corre- 
sponding to the three transmission links, respectively, and are 
assumed to be independent complex proper random variables 
(not necessarily Gaussian) with variances normalized to 1 . We 
further assume that the fading processes {hu}, {h 2 i}, and 
{h 3 i} are stationary and ergodic over time, where i is the 
time index. In (loTT i. the additive noise terms Z and Zr are 
independent proper complex Gaussian random variables with 
variances also normalized to 1. The parameters pi, p 2 , and p 3 
represent the link gain to noise ratios of the corresponding 
transmission links. The input symbol sequences {XA- and 
{Xm} are subject to separate average power constraints P 
and Pr, respectively, i.e., 

n 1 n 

-Ve|X,| 2 <P, _VE|X K | 2 <P r . (62) 

n ' n ^— ' 

i—l i—l 

Remark 4: The fading relay channel is a special case of 
the parallel relay channel with each subchannel corresponding 
to one fading state realization. In particular, for a given fading 
state the fading relay channel is a Gaussian relay channel by 
( |6Tb . However, since this Gaussian channel is not physically 
degraded, the fading relay channel is not a Gaussian parallel 
relay channel with degraded subchannels that is considered in 
Sections [II] and [Hi] where physically degradedness is assumed 
for each subchannel. 

We assume that the transmitter and the receiver know the 
channel state information instantly. Hence the source and relay 
nodes can allocate their transmitted signal powers according to 
the channel state information to achieve the best performance. 
Our goal is to study the optimal power allocation at the 
source and relay nodes. As in Section ITlI-CI we are interested 
in the asynchronized case for the fading full-duplex relay 
channel, where the source and relay nodes are required to use 
independent inputs. The main reason is because this simplifies 
the transmitter design, and is more practical in distributed 
networks, where nodes need to construct their codebooks 
independently. 

For notational convenience, we collect the fading coeffi- 
cients hi, hi and h 3 in a vector h :— (hi, h%, hs). We define 



Q = {(P(h), P R (h)) : E[P(h)] < P, E[P R (h)] < Pr} . 

(63) 

The following lower and upper bounds on the asynchronized 
capacity of the fading full-duplex relay channel were given in 
[9]. 

Lemma 1: ([9]) For the fading full-duplex relay channel, 
lower and upper bounds on the asynchronized capacity are 
given by 



Clo 



(P(h),P a (h))eg 



min < 2E 



c(P(h) P i\hi\ 2 + P R (h)p 2 \h 2 \ 2 ) 



2E A 



'(P(h)p 3 \h 3 > 2 



2E A c 



'(P(h)pi\hi 



(64) 



C up = max 

{P(h),P R (h))eG 



min < 2E 



c(p(h) Pl \hi\ 2 + P R {h) P2 \h 2 \ 



2E 



:(p(h)( P i\hi 



2 +p 3 N 2 : 



(65) 

Note that the rates in the lower bound of Lemma Q] are the 
same as the achievable rates in Corollary Q] 

The optimal power allocation that maximizes the lower 
bound d64i i and the upper bound (l65l l were obtained in [9] 
under a sum power constraint, i.e., the source and relay nodes 
are subject to a total power constraint. In this paper, we assume 
that the source and relay nodes are subject to the separate 
power constraints as given in (l62l and d63l . and derive the 
optimal power allocations that maximize the bounds (f64l > and 
d65t . respectively. We also characterize the conditions where 
the lower and upper bounds match and determine the capacity 
of the channel. 

Using the same technique as in Section IIII-CI we charac- 
terize the optimal power allocation that maximizes the lower 
bound (l64l of the fading relay channel. This optimal power 
allocation takes the same three structures as those given in 
Section ITlI-CI and is summarized in the following for the sake 
of completeness. 

Optimal power allocation that maximizes the 
lower bound ( f64b : 

Case 1 (two-level water-filling): If P R > Pr, u (P), the 
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optimal (P m (h),P R 0) (h)) is given by 
1 1 



P (0) (h) = { 



Aln2 p 3 \h 3 \ 2 
1 1 



, V A In 2 pi\h\ 



if h £ A, 



if h G A c 



(66) 

where A is chosen to satisfy the power constraint 
E[P(h)] = P. 



p R 0) ® 



pin 2 



l+Pi|foi| 2 P (0) (fr) 
p 2 \h 2 \ 2 



(67) 



where p is chosen to satisfy the power constraint 
E[Pfl(fc)] = ^k- 

The threshold Pr. u (P) as a function of the source power 
P can be solved using the following equation 

E [c (p (0) (h) Pl \hi\ 2 + p£°> (h) P2 \ha I 2 ) ] 

= E A [c(p (0) (ft)p3|fta | 2 )] (68) 

+ E A o [c(p (0) (/i)pi|/ t i 



Case 2 (orthogonal division water-filling): The optimal 

(P W (h),P^{h)) is given by 



If 



If 



A 



< 



/' 



Pi\hi\ 2 p2\h 2 \ 2 
P w (h)=' ' 
A 



> 



Aln2 pi|/ii| 2 



P ( R\h)=0; 



Pi\hi\ 2 p 2 \h 2 \ 2 ' 
P w (h)=0, pW(h) 



pln2 p 2 \h 2 \ 2 



(69) 

where A and p are chosen to satisfy the power constraints 

E[P(h)] = P and E[P R (h)] = Pr. 

Case 2 happens if the following condition is satisfied: 

E [c(p (1) (ft)pi|fe 1 | 2 + Pi 1) (/i)p2|fe| 2 )] , 

<E A [c(p (1) (h) p 3 \h 3 1 2 )] (70) 



+ Ea< 



[c(pW(ft)pi|hi| ! 



Case 3 (iterative water-filling): The optimal 
(P^ a ' , P^ a ' ) can be obtained by the following 
iterative algorithm. For a given P R {h), the value of P(h) 
is given by 

positive root x of ( |72| ( if it exists, otherwise 0, 

if he A; 

positive root x of ( 173b if it exists, otherwise 0, 

if h£A c 
(71) 

where the roots are determined by the following equations: 



P{h) = { 



Pi\h\ 



In 2 pi|/ii| 2 z + Pfl(A)p2|/i2| 2 + l 
1 - a* palfa] 2 
In 2 ' l + p 3 |/i3| 2 a; 

a* pi|fei| 2 

In 2 ' pi|/n| 2 :r + Pii(A)p2|/i2| 2 + l 

,1-a* pij/iil 2 



(72) 



(73) 



In 2 l + pi|/n| 2 a; 



= A 



where A is chosen to satisfy the power constraint 
E [PQk)] = P. For a given P(h), the value of P R (h) 
is given by 

Pn(h)=(^- 1 + P ^i m ) + (74) 



pln2 



p2 | | 



where p is chosen to satisfy the power constraint 

E[Pi?(/i)] = Pr. 

The power allocation (P(fi), PrQi)) obtained iteratively 
from ( 1711 ) and d74b with an initial PrQi) converges to 
an optimal (P^ a '(A))-Pr* ©)• The parameter a* is 
determined by the following equalizer condition: 

E [c^POilp!^ 2 + P R (h)p2\h 2 \ 2 )] 

= E A [c(p(ft) P3 |/i3| 2 )] (75) 

+ E A c [c(p{h) P i |/ii| 2 ^ 



The optimal power allocation for the upper bound ( I65l l can 
be derived in a similar fashion but it is omitted here since this 
optimization does not have an operation meaning. In general, 
the upper and lower bounds do not match. In the following 
theorem, we characterize the condition where the two bounds 
match and establish the asynchronized capacity. 

Theorem 5: For the fading full-duplex relay channel, if the 
channel statistics and the power constraints at the source and 
relay nodes satisfy the condition d70b , then the asynchronized 
capacity is given by 



C = 2E 



C{P^{h) Pl \hl\ 2 +P^{h) P 2\h2 



(76) 



where the capacity achieving power allocation 
(P( x ) (h) , (K)) takes the orthogonal (time) division 



water-filling form given in 

Proof: The lower bound d64l i and the upper bound 
have one term in common inside the "min" in their expression. 
If condition (l70l ) is satisfied, case 2 happens when solving 
the max-min problem for the lower bound d64b . In this case, 
the common term of the bounds is optimized by the power 
allocation in (|69l l and determines both bounds that result in 
C* U p = Ciow This common value is thus the asynchronized 
capacity. ■ 
The condition given in Theorem [5] essentially requires that 
the relay power Pr be small compared to the source power 
P. In this case, the optimal scheme is to maximize the rate 
at which the source and relay nodes can transmit to the 
destination node. The optimal scheme is to let the source and 
relay nodes have a time division access of the channel. For a 
given channel state realization, the node with a better channel 
to the destination node is allowed to transmit. This is similar 
to the optimal power allocation scheme for the fading multiple 
access channel studied in [39]. 

V. Fading Half-Duplex Relay Channels 

In this section, we study a fading half-duplex relay channel 
model, where the source node transmits to the relay and 
destination nodes in one channel (channel 1), and the relay 
node transmits to the destination node in an orthogonal channel 
(channel 2). We introduce a parameter 9 to represent the 
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channel resource (time and bandwidth) allocation between the 
two orthogonal channels. We draw this fading half-duplex 
relay channel model in Fig. [5] with the solid and dashed 
lines indicating the transmission links of channels 1 and 2, 
respectively. 



Rfy 



Yr ■ X R 



Sine X 



Dshtn 



Fig. 5. Fading half-duplex relay model 

The input-output relationship for the fading half-duplex 
relay channel is given by 



Yi = y/pihiX + Zi, 
Y 2 = y/p^h 2 X R + Z 2 , 
Yr = yfpih 3 X + Z R , 



(77) 



where hi, h%, and h 3 are fading gain coefficients that satisfy 
the same assumptions as for the fading full-duplex relay 
channel in Section [IV] The additive noise terms Z\, Z 2 , 
and Zr are independent proper complex Gaussian random 
variables with variances normalized to 1. The parameters pi, 
pi, and p3 represent the link gain to noise ratios of the 
corresponding transmission links. The source and relay input 
sequences are subject to the same power constraints (l62l as 
in the fading full-duplex relay channel. 

As in the full-duplex case, the channel state information is 
assumed to be known at both the transmitter and the receiver. 
Hence the source and relay nodes can allocate their powers 
adaptively according to the instantaneous channel state in- 
formation. The half-duplex channel has an additional channel 
resource allocation parameter 9 that may also be optimized. 
Our goal is to find the jointly optimal 9 and power allocation 
for the source and relay nodes that achieve the best rate. We 
also derive an upper bound on the capacity, which helps to 
establish capacity theorems for some special cases. 

We study three scenarios. In Scenario I, we fix 6 — |, 
and only consider the maximization of the achievable rate 
over the power allocation at the source and relay nodes. In 
Scenario II, we restrict 9 to be same for all channel states, and 
jointly optimize the achievable rate over this single parameter 
9 and power allocation. In Scenario EH, which is the most 
general scenario, we further allow 9 to change with channel 
state realizations, and optimize the achievable rate over all 
possible channel resource and power allocations. 

A. Scenario I: Fixed 9=1/2 

In this subsection, we study Scenario I, where the two 
orthogonal channels share the channel resource equally, i.e., 
the channel resource allocation parameter 9 = 1/2. We use 
this scenario to demonstrate the three basic structures of 



the optimal power allocation, which take simple forms. The 
optimal power allocation can be implemented in a distributed 
manner at the source and relay nodes, because each node needs 
to know only the channel state information of the links over 
which it transmits. 

In the following, we first give an achievable rate for this 
channel, and then find an optimal power allocation that max- 
imizes this achievable rate. 

Proposition 2: An achievable rate for the fading half- 
duplex relay channel Scenario I is given by 

Clou = max 



mirJE C[2P(h)pi\hi\ 2 ) + C( 2P R (h)p 2 \h 



c(2P(h)p 3 \h 3 



c(2P(h)pi\hi 



(78) 

Proposition |2] follows easily by using steps that are similar 
to the achievability proof for Theorems |2] and [3] and by using 
the channel definition f77t . 

The optimal power allocation that maximizes Ci ow in (l78l ) 
can be derived by applying Proposition [T] and are given in the 
following three cases. The details of the proof are relegated 
to Appendix U 

Optimal power allocation that maximizes the 
lower bound d78l i: 

Case 1: If P R > Pr, u (P), the optimal 
(P (0) (h),P R 0) (h)) is given by 



if h£A, 



p(V(h) = f 2 V Aln2 Ps\h3\ 



2 \Aln2 pi\hi\ 



if h £ A c 
(79) 



where A is chosen to satisfy the power constraint 
E[P(h)] = P. 



2 V^ln2 p2\h 2 f 



(80) 



where n is chosen to satisfy the power constraint 

E[Pfl(h)] = Pr. 

The threshold Pr, u (P) as a function of the source power 
P can be solved using the following equation: 



^\c{2P^ ) {h) P2 \h 2 \ 

= E A [c(2P m (h)p 3 \h 3 \ 2 ^) -c(2P (0) aO Pl |/ tl | 2 )] . 

(81) 



Case 2: If Pr < Pr,i(P), the optimal (P m (h), P^ih)) 
is given by 



P m (h) 
P?® 



1 



2 VAln2 pi|fti| 2 
1/1 1 



2 \/xln2 p 2 \h2 



(82) 



(83) 



where A and p are chosen to satisfy the power constraints 
E[P(h)] = P and E[P R (h)] = Pr. 



13 



The threshold Pr,;(P) can be solved using the following 
equation: 

E[c(2P£\h)p 2 \h 2 \ 2 )] 

= E A [c(2P w (h) P3 \h 3 \ 2 ^ -c(2P«(ft)p 1 |/i 1 | 2 )] . 

(84) 

Case 3: If Pri(P) < Pr < PrA p )> the optimal 
{P {a '\h),P R ai) (h)) is given by 

P^'Hh) = 

'positive root x of iS6\ if it exists, otherwise 0, 



1 



I 2 \Aln2 pi|/ii| 2 



if hE A; 
if h e A c 



(85) 



where the root x is determined by the following equation 

a* 1 1 — a* 1 

+ 



21n2 - 1 , + g; 21n2 - 1 ., + x 



P ( R*\h) 



1 / a* 



2 V/"ln2 p 2 |/ia| 



— A = 0. 
(86) 
(87) 



The parameters A and p, are chosen to satisfy the power 
constraints given in d63b - The parameter a* is determined 
by the following condition: 



E [c(2P (Q * ) (A)pi|^i| 2 ) +c(2P R a * ) (h)p 2 \h 2 \ 2 )} 
= E A [c(2P^'\h) P3 \h 3 \ 2 ^ 
+ E A c [c(2P^(h) Pl \hi 



(88) 



It can be seen that in all cases the optimal power allocation 
P R (h) for the relay node depends only on the fading gain hi 
of the relay-to-destination link and it is always a water-filling 
solution. However, the optimal power allocation P(h) for the 
source node in general depends on the fading gains hi and h% 
corresponding to two links (source-to-destination and source- 
to-relay), and it is not a water-filling solution in general. Only 
in cases where Pr is large or small compared to P, i.e., 
where P R > P R<U (P) or P R < P R ,i(P), the optimal P(h) 
depends only on the fading gain of one link and it reduces 
to a water-filling solution. This is intuitive because when P R 
is small compared to P, we should make the multiple access 
transmission from the source and relay nodes to the destination 
node as strong as possible, and hence the power allocation at 
the source node should be based on the fading gain hi of the 
source-to-destination link. When Pr is large compared to P, 
we should transmit as much information as possible from the 
source node to the relay node, and hence the power allocation 
at the source node should be based on the fading gain /13 of 
the source-to-relay link. 

We now provide numerical results for a Rayleigh fading 
half-duplex relay channel. We assume that the fading co- 
efficients h\, hi and are independent, zero-mean, unit 
variance, proper complex Gaussian random variables (i.e., the 
amplitudes \h\\, I/12I and \hs\ have a Rayleigh distribution). 
We further assume p\ = 0.1, P2 = 0.1, and p% = 1. We 
assume the power constraint at the source node is P = 3 



dB. This corresponds to the practical environment where the 
relay node is close to the source node. In Fig. [6] we plot the 
achievable rates for Scenario I optimized over power allocation 
(PQk)' PrQl))- We also indicate the corresponding max-min 
optimization cases to achieve these rates. It can be seen that the 
achievable rate increases as the relay power increases in cases 
2 and 3, and saturates when the relay power falls into case 
1. This is because in case 1 the relay power is large enough 
to forward all the information decoded at the relay node to 
the destination node, and the achievable rate is limited by the 
capacity of the source-to-relay link. 




5 10 
Relay Power P (dB) 



Fig. 6. Optimal achievable rates in Scenario I 




Case 3 
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- 



Case 2 



4 6 8 10 

Source Power P (dB) 



Fig. 7. Ranges of source and relay powers with corresponding max-min 
optimization cases in Scenario I 



In Fig. [7] we plot the ranges of the source and relay powers 
with their corresponding max-min optimization cases. The 
solid line in the graph divides cases 1 and 3, and corresponds 
to the threshold function P R , U (P)- The dashed line divides 
cases 2 and 3, and corresponds to the threshold function 
P R i(P). It is clear from the graph that when the relay power 
is small compared to the source power, the optimal power 
allocation falls into case 2, and when the relay power is large 
compared to the source power, the optimal power allocation 
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falls into case 1. Since the achievable rate (based on the 
decode-and-forward scheme) saturates in case 1, it is not 
useful to increase the relay power beyond the solid line in 
Fig. [7] if the decode-and-forward scheme is adopted. Hence 
the solid line Pr. u defines the relay powers that provide 
the best decode-and-forward rates under Scenario I for the 
corresponding source powers. 

B. Scenario II: Same 9 for All Channel States 

In Scenario I, 9 is fixed at 1/2; i.e., the channel resource of 
time and bandwidth is equally allocated for the two orthogonal 
channels. Such equal channel resource allocation may not be 
optimal, and therefore we consider Scenario II, where the 
channel resource allocation parameter 9 needs to be optimized 
jointly with power allocation. We also assume that 9 is the 
same for all channel states to make the system design simple. 
As in Scenario I, the optimal solution of Scenario II can also be 
implemented in a distributed manner at the source and relay 
nodes. This is because the optimal 9 depends only on the 
channel statistics, not on the channel state realizations. The 
power allocation at each node depends only on the channel 
state of the links over which the node transmits. 

We first give an achievable rate (lower bound on the 
capacity) and a cut-set upper bound on the capacity. We then 
study the joint channel resource and power allocations that 
optimize these bounds. We also characterize the condition 
when the two bounds match and establish the capacity. 

Proposition 3: An achievable rate for the fading half- 
duplex relay channel scenario II is given by 



Ciow = max 

o<9<i,(P(h),p R (h))eg 



min < E 



29C 



[ P(h)pi\hi 

V 



29c(^fM 



29C 



fP(h)p 3 \h 3 \ 2 



29C 



(89) 



where 9 = 1 — 9. An upper bound on the capacity is given by 



a 



up 



max 



min < E 



E 



0<S<l,(P(h),P R {h))Gg 

il 



29C 



29C 



P{h) Pl \h-^ 



29C 



PR{h) P i\hi\ 



mzo(P3ifr3i 2 +PiM 2 ) 



(90) 

We provide the optimal channel resource and power allo- 
cations (9, P(h), Pr(/i)) that solve dHJl in the following. The 
proof of optimality is relegated to Appendix UTI 

Optimal resource allocation that maximizes the 
lower bound 



Case 1: This case is included in case 3 with the parameter 
a being allowed to take the value of 0. 
Case 2: The optimal (9 W , P w (h), P^(h)) can be 
obtained by the following iterative algorithm. For a given 



9, the power allocation (P(h), Pit.(h)) are given by 
1 1 



P{h) = 9 
Pn(h) = 9 



A In 2 pi|/ii| 2 
1 1 



pln2 p 2 \h 2 \ 2 



(91) 



(92) 



where A and p are chosen to satisfy the power constraints. 
For a given (P(h), Pr(!i)), the value of 9 is given by the 
root of the following equation: 



2E 



C 



P(h)pi\hi 



C 



P R (h)p 2 \h 2 



In 2 



P(fe)pi|fei| 2 
e + P(h)pi\hi 



P R (h)p 2 \h 2 \ 
'' + P a (h)pa|Aa| s 



(93) 

The resource allocation (9, P (h), Pr(Ii)) obtained itera- 
tively from J91t.d92b. and d93t converges to the optimal 

(0W,pW(ft),?f(ft)). 

This case happens if the following condition is satisfied 



c 



P { R\h) P 2\h 2 \ 



em 



< (1) E A 



c 

-c 



P (1 \k)pz\h 3 \ 2 
0(1) 

P {1) {h) Pl \h-_ 
6>(i) 



(94) 



Case 3: The optimal {e {a *\ P {a "\K), P$ * } (h)) can be 
obtained by the following iterative algorithm. For a given 
9, the power allocation (P(fi), Pn(h)) is given by 



P{h) = { 



positive root x of l !96l l if it exists, otherwise 0, 

if he A, 

1 1 



A In 2 pi\hx 



if he A c ; 
(95) 

where the root x is determined by the following equation: 



a o 
\n2 



+ x 



+ 



(!-<*') 
In 2 



1 



p R (h) = s 



1 



+ x 



pln2 p 2 \h 2 \ 



A = 0. 
(96) 
(97) 



The parameters A and p, are chosen to satisfy the power 
constraints. 

For a given {P{h), P R (h)), the value of 9 is the root of 
the following equation: 



2a*F,A 



\r ( -p(A)pi|fcil 2 A] _ alp , [ p(fe)pi|/ii|- 



+ 2E^[C(^1^)]__1_ EAC 
+ 2(l-a*)EA 



r p(h)pi\hi\ 2 

Le+p(h) P1 |/nl 2 



l-a* 



P(h) P3 \h 3 \ J 
8+PQt)P3\h3\' 



(98) 

The resource allocation (9, P(h), Pr(!±)) obtained itera- 
tively from l|95t.(l97|>, and d98t converges to the optimal 

(e( a *\p( a *\h)^Pg'* ) (}i)). 
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Finally, the parameter a* is determined by the condition 

'P C< °fe)piM 



E 



+ 



>(0 



(h)p2\h 2 



(ft)p3|fta| 



+ Ea- 



(99) 



The optimization for the upper bound ( |90l > can be performed 
in a similar manner, and is not presented in this paper. In 
general, the lower bound ( |89l and the upper bound d90t do 
not match. In the following theorem, we characterize the 
condition under which the two bounds match and hence yield 
the capacity of this channel. 

Theorem 6: For the fading half-duplex relay channel Sce- 
nario II, if the channel statistics and the power constraints 
satisfy the condition d94l ), then the capacity is given by 



C = E 



29 (1) C 



pW(ft)pi|/ii 



+ 



29 {1) C 



P { i ] {h) P2 \h 2 \ 



(100) 



achieving resource allocation 
can be obtained iteratively from 



where the capacity 

(eu,pW(h),pg\h)) 

(ED, <92) and @. 

The proof of Theorem [6] is similar to the proof of Theorem 
[5] and is hence omitted. 

Remark 5: The capacity in Theorem [6] refers to the largest 
rate under Scenario II that can be achieved over all possible 
channel resource allocation parameters 8 and over all possible 
power allocation rules (P(h), Pr(Ji)). 

The condition given in Theorem|6]tends to be satisfied either 
when the relay power Pr is small compared to the source 
power P, or when the relay is much closer to the source than 
to the destination. 

In Fig. [8] we plot the lower and upper bounds on the capac- 
ity of Scenario II for the same Rayleigh fading relay channel as 
in Fig. [6] Both bounds are optimized over (9, P(h), PrQi))- It 
can be seen from Fig. |8]that when the relay power is less than 
a threshold (4 dB), the two bounds match and determine the 
capacity of Scenario II. This demonstrates our capacity result 
in Theorem [6] and the condition when the lower and upper 
bounds match. Fig. [H] also shows that the gap between the 
lower and upper bounds is small even when the relay power 
is large. 

In Fig. [9] we plot the ranges of the source and relay powers 
with their corresponding max-min optimization cases. The 
dashed line in the graph divides cases 2 and 3. Similar to 
Fig. [7] the optimal power allocation falls into case 2 when the 
relay power is small compared to the source power. However, 
we see that Fig. [9] deviates from Fig. [7] in that case 1 (where 
the achievable rate saturates) is missing in Scenario II. This 
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Fig. 8. Lower and upper bounds on capacity of Scenario II 
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Fig. 9. Ranges of source and relay powers with corresponding max-min 



optimization cases in Scenario II 
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Fig. 10. Optimal 8 as a function of relay power in Scenario II 
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explains why the achievable rate under Scenario II continues 
to increase beyond the point where the rate under Scenario I 
saturates (see Fig. QT|in Section [V-Ct . 

In Fig. QJJ we plot the optimal value of 9 as a function 
of the relay power, and observe that it is not a monotonic 
function. When the relay power is small, as the relay power 
increases, the optimal 9 decreases so that more of the channel 
resource is assigned to the relay-to-destination link to make 
more use of the relay node. When the relay power is large, 
as the relay power increases, the optimal 9 increases. This is 
because the relay power is now large enough to forward all the 
information decoded at the relay node to the destination node 
even with a small amount of the channel resource, and hence 
more of the channel resource is needed for the source node 
to transmit more information to the relay node. This behavior 
of the optimal 9 is similar to that of the Gaussian half-duplex 
relay channel studied in [37]. 

C. Scenario III: 9 Changes with Channel States 

In Scenario II, the parameter 9 is required to be the same 
for all channel states, and only the power allocations are 
dynamically adjusted according to the instantaneous channel 
state. In this subsection, we study Scenario III, where 9 is 
also allowed to change with the channel state realizations, and 
9(h) is optimized jointly with power allocation (P(h), Pr(Ii)). 
However, for the source and relay nodes to decide 9(h) for 
each channel state, each node needs to know the channel 
realizations on all transmission links. This makes the system 
design more complex, and not as practical as Scenario II. We 
include the analysis of the resource allocation for this scenario 
mainly for the sake of completeness. 

Proposition 4: An achievable rate for the fading half- 
duplex relay channel Scenario III is given by: 

Ciow = max min 

O<0(h)<l, 

(p(h),p R (h))eg 



E 



E. 



29(h)C ( 



P(h)pi\h! 
6(h) 



E 



29(h)C 



)P2\h 2 \' 



8(h) 



29(h)C( P{ ^^ 



E 4 e 



(101) 



An upper bound on the capacity is given by 
C up = max min 

0<9(h)<l, 

(p(h),p R (h))eg 



Optimal resource allocation that maximizes the 
lower bound ( 11011 ): 

Case 1: The optimal resource allocation 
(8 l °Hh),P (0 \h),P R 0) (h)) is given by 



P (0) (h) 



pgHk) 



Aln2 psl^l 2 , 
1 1 




if he A, 

if he A c ; 
(103) 

if P m (h) =0; 

if P m (h) > 0, 
(104) 

where A and p are chosen to satisfy the power constraint 
given in ((63). 



(A) 



if P (0) (h) > 0; 
if P m (h) =0. 



(105) 



For case 1 to happen, (8 m (h), P (0) (h), P R 0) (h)) needs 
to satisfy the following condition: 

v[c(p^(h) P2 \h 2 \ 2 )] 

> E A [C (P <0) (h) P3 \h 3 \ 2 ) -C (P (0) (/i)piN 2 )] . 

(106) 

Case 2: The optimal (8 W (h) , P (1) (h) , p£° (h)) can be 
determined by the following iterative algorithm. For a 
given 8(h), the power allocation (P (h), PrQi)) is given 
by 

P ^- e ^(xL-^T (10?) 
1 1 



P R (h) = 9(h) 



filn2 p2\h 2 \ 



(108) 



where A and [i are chosen to satisfy the power constraints 
given in {63} . For a given (P(h), Pr(Ii)), the channel 
resource allocation 8(h) is the root of the following 
equation: 



2C 



P(h)pi\hi 



1 



Kh) 

P(h)pi\h! 



2C 



( PR(h)p 2 \h 2 \ 1 



V o(h) 



+ 



P R (h) P2 \h 2 \ 7 



In 2 8(h) + P(h)pi\hi\ 2 In 2 8(h) + P R (h)p 2 \h 2 \ 2 
= 0. 

(109) 

For case 2 to happen, (8 W (h) , P (1) (h) , P R 1} (h)) needs 
to satisfy the following condition: 



E 



E 



2 e(h)C ( p(i0 fl ff l|S 



E 



20(h)C ( Pa( -^ h21 ' 



29(h)C ( 



P(A)(p 3 |M 2 +Pi|/H| 2 



(102) 

The optimal resource allocation (9(h), P(h), PrQi)) that 
achieves the maximum of the lower bound (11011 ) is given in 
the following. The proof of optimality is relegated to Appendix 
ED 



8 (1) (h)C 



pk\k)P2\h 2 y 

8W(h) 
P W (h) P3 \h 3 



9 (1) (h)C 
-8 w (h)C 



8W(h) 
P (1) (h)p 1 \h 



8W(h) 



(110) 



Case 3: The optimal (8 {a " \h) , P (a ' \h) , P^* } (h)) can 
be obtained by the following iterative algorithm. For a 
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given 8(h), the power allocation (P(h), Pn(h)) is given 
by 



pQl) 



'positive root x of J 1 12t > if it exists, otherwise 0, 

if he A, 

i-hj { t^-t -J-^l , n /i- .r 



1 



Aln2 pi|/ii| 2 



(111) 



where the root x is determined by the following equation: 



a* 6(h) 



1 



In 2 , 



In 2 



6(h) 
P3 \ h 3\ 



P R (h) = 0(h) 



M ln2 p 2 |/i 2 | 2 



—A = 0. 
(112) 
(113) 



where the parameters A and p are chosen to satisfy the 
power constraints < |63 b - 

For a given (P(h), Pr(Il)), the channel resource alloca- 
tion 0(h) is determined by 

If h£ A, 

fP(h)pi\hi\ 2 \ a P(/i)pi|fti| 2 



2a*C 



V o(h) 



In 2 6(h) + P(h)p 1 \h 1 



2a * c [ PR(h)p2\h 2 \ 2 \ a* P R (h) P2 \h 2 \ 2 

\ S(h) J In 2 6(h) + P R (h)p 2 \h 2 \ 2 



- 2(1 - a*)C 
1 — a* 



P(h)p 3 \h 3 
6(h) 
P(h)pz\h 3 \ 2 



In 2 6(h) + P(h) P3 \h 3 \ 2 
= 0; 



(114) 



If ft G A c , 
- 2a* C 



P R (h) P2 \h 2 \ 2 
6(h) 



= o. 



a* Pi?.(Zl)p2|fe2| 2 
" In 2 6(h) +P R .(h)p 2 \h 2 \ 

1 P(ft)pi|fci| 2 
In 2 0(h) + P(h) Pl \h!\ 2 



The parameter a* is determined by the condition 

' P^' ) (h) Pl \h 1 \ 2 



+ E 



g(<**) 



(A)C 



00**) (/i) 
P« Q * ) (ft)p 2 |fe| 2 



+ E a 



p (c °(A)p3|M 



6<~ a *)(h) 



(115) 



The optimization for the upper bound ( 1102b can be per- 
formed using steps that are similar to those for the lower 
bound. In general, the lower bound dlOU and the upper bound 
( 11021 ) do not match. However, we show that if the channel 
statistics and the power constraints satisfy the following con- 
dition, the two bounds match and hence we obtain the capacity 
for this channel. 



Theorem 7: For the fading half-duplex relay channel Sce- 
nario III, if the channel statistics and the power constraints 
satisfy the condition ( II 101 >, then the capacity is given by 



C=E 




P {1) (h)Pi\hi\ 



P R 1] (h)p 2 \h 2 \ 



9W(h) 



(116) 



where the capacity achieving resource allocation 
(6^(h),P^(h),P^(h)) can be obtained iteratively 



from (fT07T> . (ITOl and dT09l 

The proof of Theorem [7j is similar to that of Theorem [5] 
and is omitted. 

Remark 6: The capacity in Theorem [7J refers to the largest 
rate under Scenario III that can be achieved over all pos- 
sible channel resource allocation 6(h) and power allocation 
(P(h),P R (h)). 

The condition given in Theorem [7j is similar to that in 
Theorem [6] for Scenario II, and these conditions tend to be 
satisfied either when the relay power P R is small compared 
to the source power P, or when the relay node is much closer 
to the source node than to the destination node. 
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Fig. 1 1 . Comparison of achievable rates with optimal resource allocations 
for Scenarios L II, and III 



In Fig. QT] we plot the achievable rates under Scenario III 
optimized over (0(h), P(h), Pr(H)) for the same Rayleigh 
fading relay channel as in Fig. [6] and Fig. [S] We compare 
these rates with the achievable rates under Scenario I opti- 
mized over (P(h), Pn(h)) and Scenario II optimized over 
(0, P(h), Pn(h)), and with the capacity of the direct link 
from the source node to the destination node. It is clear from 
the graph that employing the relay node greatly improves 
the performance of the source-to-destination channel. Fig. QT| 
shows that the achievable rate under scenario II is larger than 
the achievable rate under Scenario I, particularly when the 
relay power is large and the achievable rate under scenario 
I saturates. This demonstrates that using a jointly optimal 
channel resource allocation parameter 8 helps to improve the 
achievable rate. As we have commented for Fig. [9] Scenario 
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II does not have case 1, and hence the achievable rate under 
Scenario II continues to increase when the achievable rate 
under Scenario I saturates in case 1 . Furthermore, Scenario III 
has larger achievable rates than Scenario II because 9(h) can 
be dynamically changed based on the instantaneous channel 
state information. 

We note that finding an optimal resource allocation for 
Rayleigh fading relay channels is a high dimensional op- 
timization problem, particularly in Scenario III where the 
optimization is jointly over (0(h), P(h), PrQi))- Although 
the problem is convex, the standard convex programming 
techniques may converge slowly. However, since we have 
obtained the analytical structures of the optimal solutions, our 
numerical algorithm converges extremely fast and takes only 
a few iterations. 

VI. Concluding Remarks 

We have studied capacity bounds for the parallel relay 
channel and its special case of the fading relay channel. 
We have established capacity theorems for several classes of 
channels including the parallel relay channel with degraded 
subchannels and its Gaussian case, the full-duplex relay chan- 
nel that satisfies certain conditions in asynchronized case, and 
the half-duplex relay channel that satisfies certain conditions. 

We have studied resource allocation for the Gaussian par- 
allel relay channel with degraded subchannels and the fading 
relay channel under both full-duplex and half-duplex models. 
Our study of resource allocation is different from previous 
work on this topic in that we make the more practical 
assumption that the source and relay nodes are subject to 
separate power constraints rather than a total power constraint. 
We have shown that optimal resource allocation under this 
assumption may take three different forms depending on the 
channel statistics and values of the power constraints. 

Finally, we note that the resource allocation problem we 
have considered falls under a class of max-min problems and 
we have provided a technique for solving such max-min prob- 
lems. It is known that the achievable rates of relay channels 
when relay nodes use the decode-and-forward scheme are 
usually expressed by max-min forms. Our technique certainly 
applies to optimization problems arising in these contexts. In 
particular, our technique has been applied to study orthogonal 
relay broadcast channels in [40], and can be used to study 
more general classes of relay networks with fading links. 

Appendix I 

Proof of Resource Allocation that Maximizes C loy 
d78l ) for Scenario I 

We first let Ri(P(h), P R (h)) and R 2 (P(h), P R (h)) denote 
the two terms over which the minimization in (l78l l is taken. 
We can then express (F78b in the following compact form: 

Ciow = max 

(P(h),p R (h))eg (U7 ) 

mm{R 1 (P(li),P R (h)), R 2 (P(h),P R (h))} . 

We apply Proposition Q] to derive the optimal power alloca- 
tion rule, which falls into the following three cases. 



Case 1: a* = 0, and (P<W (ft), P R °> (h)) is an optimal power 
allocation, which needs to satisfy the condition 

Ri(P (0) (h),P R °\h)) > R 2 (P iQ) (h),P R 0) (h)). (118) 

By definition, (P {a) (h) , P R 0) (h)) maximizes 

R(0,P(h),PB.(h)) = R 2 (P(h),P R (h)). (119) 

The optimal P^(h) given in (|79l follows easily from the 
KKT condition. For case 1 to happen, (P {0) (h), P R °\h)) 
needs to satisfy the condition ( II 18b . It is clear that 
R2(P(h),PR(h)) depends only on P(h). The term 
Ri(P(h),P R (h)) depends on both P(h) and P R (h). 
To characterize the most general condition for case 1 to 
happen, P R (h) needs to maximize Ri(P(tk), P R (h)). Such 
P R ^ (h) can be obtained by the KKT condition and is given 
in (fSQb . The condition (JSTJ follows from equality of condition 

dm. 

Case 2: a* = 1, and (P^(h), P { R } (h)) is an optimal power 
allocation, which needs to satisfy the condition 

Ri(P {1) (h),P { R \h)) < R2(P {1) (h),P R 1} (h)). (120) 

The optimal (pW (h), P R 1] (h)) that maximizes 

R(l,P(h),P R (h)) = Ri(P(h),P R (h)) (121) 

can be easily obtained by the KKT condition, and are given 
in d82| i and ( f83l >. The condition d84"b follows from equality of 
condition (1 1 20b - 

Case 3: < a* < 1, and (P ia " } (h), pjf* 5 (h)) is an 
optimal power allocation, where a* is determined by the 
following condition 

Ri(P [a "Hh), P^^ (h)) = R2(P {a ' ) (h) 1 P R a '\h)). (122) 

We need to derive (P (a "\h), P R an (h)) that maximizes 

R(a\P(h),P R (h)) 

= «*Pi(P(/i),Pr(/i)) + (1 -a*)R 2 (P(h),P R (h)). 

(123) 

The Lagrangian can be written as 

C =a*E A [C (2P(h)p 1 \h 1 \ 2 )] + a*E [C (2P R (h)p 2 \h 2 \ 2 )} 
+ E A . [C(2P(/i) j0l |/ ? , 1 | 2 )] 
+ (1-«*)E A [C (2P(h) P3 \h 3 \ 2 )] 

- x(E[P(h)} - P) - n(E[P R Qkj\ - P R ) 

(124) 

where A and /i are Lagrange multipliers. 
For he A, the KKT condition is given by: 

dC 
dP(h) 

a* 1 1 - a* 1 

'^'^=k^TpJh) + ^2~' J -^ J + P(h) 

< A, with equality if P(h) > 

(125) 

It is easy to check that P^ a ">(h) for h e A given in ( f85t 
satisfies the preceding KKT condition. The p( Q )(h) for h S 
A c in ((851) and P R a * ] (h) in <|87]i also follow from the KKT 
condition. 
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Appendix II 

Proof of Resource Allocation that Maximizes Ci oh 
for Scenario II 



The Lagrangian can be written as 



We let Ri_(Q, P(h),P R (h)) and R 2 (9, P(h),P R {h)) denote 
the two terms over which the minimization in (l89l is taken. 
We can then express ( f89l > in the following compact form: 

Cioh = max 

0<9<i,(p(h),p R (h))eg 

min {i? a (0, P(h), P R (h)), R 2 {9, P(h), P R (h))} . 

(126) 



The max-min problem in ( 11261 ) can be solved by 
using Proposition Q] The main step is to obtain 

(#(«) ) p(«) (h^ p<>) (ft)) that maximizes 

R{a,6,P(h),P R {h)) 

:= aR^Piti^nih)) + (1 - a)R 2 (9,P(h),P R (h)) 

(127) 

for a given a. The following lemma states that maximizing 
the function R(a,9, P{h), P R {h)) over (9, P(h), P R (h)) is a 
convex programming problem and hence can be solved by 
standard convex programming algorithms. 

Lemma 2: For a fixed a, R(a,9, P(h), P R (h)) is a con- 
cave function over (9, P(h), P R (h)), where < 9 < 1 and 
{P(h),P R {h))&Q. 

Lemma [2] can be verified by computing the Hessian of the 
function R(a,9, P(h), P R (h)) (for a fixed a) and showing 
that it is negative semidefinite. 

We now apply Proposition [T] to study the max-min problem 
in (11261 ) by considering the following three cases. 

Case 1: a* = 0, and (6K°) , p(°) (h) , P R 0) (h)) is an optimal 
resource allocation, which needs to satisfy the condition 

Ri(8W,pW(h),P R °\h)) > R2(0^\P^(h),P R °\h)). 

(128) 

We first derive (9 {0) , P (a) (h) , P R 0) (hj) that maximizes 

R(0,9,P{h),P R {h)) = R 2 (9,P(h),P R (h)). (129) 

It is easy to see that the optimal 9^ — 1 from the 
expression of R 2 (9, P(h), P R (h)), and this results in 

Ri(8(°\pW(h),P R °\h)) < R 2 (9^\P^(h),P R °\h)). 

(130) 

Comparing ( 11281 ) and ( 11301 ), it is clear that only equality can 
be satisfied in ( 1 1 28b . Hence this case can be included in the 
following case 3 with a* being allowed to take the value of 
0. 

Case 2: a* = 1, and (<?W , P (1) (h), P { R ] (h)) is an optimal 
resource allocation, which needs to satisfy the condition 

fli^ 1 *,^ 15 ®,^ 1 '©) < P2(0 {1) ,P {1) (h),P R \tD)- 

(131) 

We first derive (9 (1 \ P (1) (h), P^ (h)) that maximizes 
R(l,9,P{h),P R (h))=R 1 (6,P(h),P R (h)). (132) 



C = 29E 



( P(h)Pi\hi 

V o 



29E 



P R (h)p2\h 2 \ 



\{E[P{h)]-P) - n[E[P R (h)] - P R 



It is easy to check that the KKT condition implies 

1 1 



P(h) 



P R (h) 



Aln2 pi|/ii| 2 
1 1 



/iln2 p 2 \h 2 



The KKT condition also implies that the optimal 
to satisfy the following condition: 



(133) 

(134) 

(135) 
W needs 




9 + P R (h) P2 \h 2 \ 



if 6» = 0; 
if < 9 < 1 
if 9 = 1. 



(does not happen) 
(does not happen) 



(136) 



where the first and third cases do not happen because ^ — > oo 
as 9 — > 0, and S§ — * — oo as 
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1. It can also be shown that 



^ is monotonically decreasing for < 6 < 1. Hence ^ has 
at most one root for < 9 < 1. 

The iterative algorithm described in (l9Tb-(l93T> converges 
to the solution of the KKT condition given in ( 1134l )-( fT36l ). 
Since the function R\(9, P(h), P R {h)) is concave, the solution 
of the KKT condition achieves the optimum. Condition d94l ) 
follows from condition d 1 3 1 b - 

Case 3: < a* < 1, and {9^'\P^"\h),P R a ' ] {h)) is 
an optimal resource allocation, where a* is determined by the 
following condition 



Ri{6( a '\P {a *Xli),P R a *\li)) 

= i? 2 (^ Q *),p( Q *)(/i),P^* ) (/i)). 



(137) 



We first derive (9<- a *\ P [a * ] (h), P R a '\h)) that maximizes 
R(a*,6,P(h),P R (h)) 

= ^R^O, P(h), P R (h)) + (1 - a*)R 2 (9, P(h), P R (h)). 

(138) 

for given a*. The Lagrangian can be written as 



C =2a*9E A 
+ 2a*9E 
+ 29E A c 



( PQk)pi\h x 

V o 

n (Pli{k)p 2 \h 2 



V o 

( P(h)pi\h 



|2\ 1 



(139) 



c f P(h)p 3 \) 



+ 2{l-a*)9E A 
- \(E[P(h)} -P)- p(v{P R (h)} - P R ^ 
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For a given 9, The optimal (P(h), P R (h)) given in d95l ) and 
(l97l ) follows from the KKT condition. The KKT condition also 
implies that the optimal 9 for a given (P(h), P R (h)) needs to 
satisfy the following condition 

dC 



= 2a*E A 



2a*E 



2E A c 



C 



( P R (h)p 2 \h 2 \ 
P(h)pi\hi\ 2 



P{h) Px \hi 



a ' 
ln2 



E 



In 2 A 



9 + P(h) Pl \hi\ 2 
P R (h) P 2\h 2 \ 2 
9~ + P R (k)p 2 \h 2 
P&piM 2 



9 + P(h) Pl \h l \' 



2(l-a*)E A 
1 



M 2 



In 2 

<0, 
= 0, 

>o, 



-E A 



P(h)p 



■3\h 3 \ 2 



-P(h)p 3 \h 3 \ 
if 9 = 0: 
if < 9 < 1; 
if 9 = 1. 



(does not happen) 
(does not happen) 



(140) 



where the first and third cases do not happen because ^ 
as 9 — > 0, and |§ — > — oo as — > 1. 

Therefore, the iterative algorithm described in 
converges to the solution of the KKT condition. Since the 
function R(a,9, P(h), P R Qi)) is concave for a given a, the 
solution of the KKT condition achieves the optimum. Condi- 
tion (f99t follows from condition ( 1137b . 

Appendix III 

Proof of Resource Allocation that Maximizes Ci ow 
dlOU for Scenario III 

We let Ri(9(h),P(h),P R (h)) and R 2 (9(h), P(h), P R (h)) 
denote the two terms over which the minimization in dlOU 
is taken. We can then express ( 110U in the following compact 
form: 

Ciow = max 

0<e(h)<l. V h 
{P(h),P R (h))eQ 

min {fli (6(h), P(h), P R (h)), R 2 (9(h), P(h),P R (h))} 

(141) 

As in Scenario II, one main step to solve the max-min 
problem in ( [T4"IT l is to obtain (9 {a ^ (h) , P^(h) , P R a) (h)) that 
maximizes 

R(a, 9(h), P{h), P R (K)) := aRi{9(h), P{h),P R (h)) 

+ (l-a)R 2 (9(h),P(h),P R (h)) 

(142) 

for a given a. The following lemma states that maximizing 
R(a,6(h),P(h),P R (h)) over (9(h), P(h), P R (h)) is a con- 
vex programming and hence can be solved by standard convex 
programming algorithms. 

Lemma 3: For a fixed a, R(a, 9(h), P(h), P R (h)) is a con- 
cave function over (9(h), P(h), P R (h)), where < 9(h) < 1 
for all h and (P(h), P R (h)) G Q. 



Lemma [3] can be verified by computing the Hessian of the 
function R(a, 9(h), P(h), P R (fi)) (for a fixed a) and showing 
that it is negative semidefinite. 

As for Scenarios I and II, we apply Proposition [T] to study 
the max-min problem in (I14U by considering the following 
three cases. 

Case 1: a* = 0, and (9^(h),P^(h),P^ ] (h)) is an opti- 
mal resource allocation, which needs to satisfy the condition 

Ri(e (a) (h),p (0 Hh),p^(h)) 



>R 2 (9 i0) (h),P^(h),P R 0) (h)) 



(143) 



We first derive (9^(h), P (a) (h), P R 0) (h)) that maximizes 

R(0, 9(h), P(h), P R (K)) = R2(9(h),P(h),P R (h)). (144) 

It is clear that 9^°'(h) given in ( 1105t is optimal from the 
expression of R2(9(fi), P(h), Pr(Il))- The power allocation 
P^(h) given in dl03b then easily follows from the KKT 
condition. 

For case 1 to happen, condition (1143b needs to be satisfied. 
To characterize the most general condition for case 1 to 
happen, for the given 9 ( -°\h) and P^°\h) P R (h) needs to 
maximize Ri(9^ 0) (h), P (0) (h), P R (h)), which has the follow- 
ing form: 



Ri(0 {o) (h),pW(h),P R (h)) 



= 2E 



{/llPtO)^) 



=0} [C (P R (h)p 2 \h 2 f 



(145) 



The optimal P ( R 0) (h) given in (11041 ) then follows from the KKT 
condition. Finally, condition ( 11061 ) follows from condition 

dm . 

The proofs for cases 2 and 3 are similar to those for Scenario 
II given in Appendix iHl and are omitted. 
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